AITopics | mturk worker

Collaborating Authors

mturk worker

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

_NeurIPS_2023__BASALT_Benchmark

Stephanie Milani

Neural Information Processing SystemsFeb-13-2026, 02:43:07 GMT

TrueSkill can handle multi-player games.

artificial intelligence, machine learning, social media, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Industry: Leisure & Entertainment > Games (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

_NeurIPS_2023__BASALT_Benchmark

Stephanie Milani

Neural Information Processing SystemsOct-8-2025, 20:15:03 GMT

TrueSkill can handle multi-player games.

artificial intelligence, machine learning, social media, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Industry: Leisure & Entertainment > Games (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Incorporating Worker Perspectives into MTurk Annotation Practices for NLP

Huang, Olivia, Fleisig, Eve, Klein, Dan

arXiv.org Artificial IntelligenceNov-15-2023

Current practices regarding data collection for natural language processing on Amazon Mechanical Turk (MTurk) often rely on a combination of studies on data quality and heuristics shared among NLP researchers. However, without considering the perspectives of MTurk workers, these approaches are susceptible to issues regarding workers' rights and poor response quality. We conducted a critical literature review and a survey of MTurk workers aimed at addressing open questions regarding best practices for fair payment, worker privacy, data quality, and considering worker incentives. We found that worker preferences are often at odds with received wisdom among NLP researchers. Surveyed workers preferred reliable, reasonable payments over uncertain, very high payments; reported frequently lying on demographic questions; and expressed frustration at having work rejected with no explanation. We also found that workers view some quality control methods, such as requiring minimum response times or Master's qualifications, as biased and largely ineffective. Based on the survey results, we provide recommendations on how future NLP studies may better account for MTurk workers' experiences in order to respect workers' rights and improve data quality.

artificial intelligence, mturk worker, natural language, (18 more...)

arXiv.org Artificial Intelligence

2311.02802

Country:

North America > United States > California (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.35)

Add feedback

Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization

Zhang, Lining, Mille, Simon, Hou, Yufang, Deutsch, Daniel, Clark, Elizabeth, Liu, Yixin, Mahamood, Saad, Gehrmann, Sebastian, Clinciu, Miruna, Chandu, Khyathi, Sedoc, João

arXiv.org Artificial IntelligenceJun-13-2023

To prevent the costly and inefficient use of resources on low-quality annotations, we want a method for creating a pool of dependable annotators who can effectively complete difficult tasks, such as evaluating automatic summarization. Thus, we investigate the recruitment of high-quality Amazon Mechanical Turk workers via a two-step pipeline. We show that we can successfully filter out subpar workers before they carry out the evaluations and obtain high-agreement annotations with similar constraints on resources. Although our workers demonstrate a strong consensus among themselves and CloudResearch workers, their alignment with expert judgments on a subset of the data is not as expected and needs further training in correctness. This paper still serves as a best practice for the recruitment of qualified annotators in other challenging annotation tasks.

large language model, machine learning, natural language, (24 more...)

arXiv.org Artificial Intelligence

2212.10397

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Florida > Palm Beach County > Palm Beach (0.04)
(19 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment (1.00)
Law (1.00)
Media > Film (0.68)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
(2 more...)

Add feedback

Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering

Sen, Priyanka, Aji, Alham Fikri, Saffari, Amir

arXiv.org Artificial IntelligenceOct-4-2022

We introduce Mintaka, a complex, natural, and multilingual dataset designed for experimenting with end-to-end question-answering models. Mintaka is composed of 20,000 question-answer pairs collected in English, annotated with Wikidata entities, and translated into Arabic, French, German, Hindi, Italian, Japanese, Portuguese, and Spanish for a total of 180,000 samples. Mintaka includes 8 types of complex questions, including superlative, intersection, and multi-hop questions, which were naturally elicited from crowd workers. We run baselines over Mintaka, the best of which achieves 38% hits@1 in English and 31% hits@1 multilingually, showing that existing models have room for improvement. We release Mintaka at https://github.com/amazon-research/mintaka.

artificial intelligence, natural language, question answering, (16 more...)

arXiv.org Artificial Intelligence

2210.01613

Country:

South America > Argentina (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > Dominican Republic (0.04)
(19 more...)

Genre:

Research Report (0.82)
Questionnaire & Opinion Survey (0.68)

Industry:

Media > Film (0.93)
Leisure & Entertainment > Sports > Football (0.93)
Government > Regional Government > North America Government > United States Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.73)

Add feedback

Inside the 1TB ImageNet data set used to train the world's AI: Nude kids, drunken frat parties, porno stars, and more

#artificialintelligenceOct-23-2019, 16:12:57 GMT

Special report ImageNet – a data set used to train AI systems around the world – contains photos of naked children, families on the beach, college parties, porn actresses, and more, scraped from the web to train computers without those individuals' explicit consent. The library consists of 14 million images, each placed into categories that describe what's pictured in each scene. This pairing of information – images and labels – is used to teach artificially intelligent applications to recognize things and people caught on camera. The database has been downloaded by boffins, engineers, and academics to train hundreds if not thousands of neural networks to identify stuff in photos – from assault rifles and aprons to magpies and minibuses to zebras and zucchinis, and everything in between. In 2012, the data set was used to build AlexNet, heralded as a breakthrough development in deep learning since it marked the first time a neural network outperformed traditional computational methods at object recognition in terms of accuracy.

artificial intelligence, imagenet, machine learning, (18 more...)

#artificialintelligence

Country:

Oceania > New Zealand (0.05)
North America > United States > New York (0.05)
North America > United States > California (0.05)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Study on Agreement in PICO Span Annotations

Lee, Grace E., Sun, Aixin

arXiv.org Artificial IntelligenceApr-21-2019

In evidence-based medicine, relevance of medical literature is determined by predefined relevance conditions. The conditions are defined based on PICO elements, namely, Patient, Intervention, Comparator, and Outcome. Hence, PICO annotations in medical literature are essential for automatic relevant document filtering. However, defining boundaries of text spans for PICO elements is not straightforward. In this paper, we study the agreement of PICO annotations made by multiple human annotators, including both experts and non-experts. Agreements are estimated by a standard span agreement (i.e., matching both labels and boundaries of text spans), and two types of relaxed span agreement (i.e., matching labels without guaranteeing matching boundaries of spans). Based on the analysis, we report two observations: (i) Boundaries of PICO span annotations by individual human annotators are very diverse. (ii) Despite the disagreement in span boundaries, general areas of the span annotations are broadly agreed by annotators. Our results suggest that applying a standard agreement alone may undermine the agreement of PICO spans, and adopting both a standard and a relaxed agreements is more suitable for PICO span evaluation.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3331184.3331352

1904.09557

Country:

North America (0.14)
Europe > France > Île-de-France > Paris > Paris (0.05)
Asia > Singapore (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.94)
Research Report > New Finding (0.87)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Use Amazon Mechanical Turk with Amazon SageMaker for supervised learning Amazon Web Services

#artificialintelligenceAug-3-2018, 02:35:49 GMT

Supervised learning needs labels, or annotations, that tell the algorithm what the right answers are in the training phases of your project. In fact, many of the examples of using MXNet, TensorFlow, and PyTorch start with annotated data sets you can use to explore the various features of those frameworks. Unfortunately, when you move from the examples to application, it's much less common to have a fully annotated set of data at your fingertips. This tutorial will show you how you can use Amazon Mechanical Turk (MTurk) from within your Amazon SageMaker notebook to get annotations for your data set and use them for training. TensorFlow provides an example of using an Estimator to classify irises using a neural network classifier.

annotation, artificial intelligence, machine learning, (18 more...)

#artificialintelligence

Country: North America > United States > California > Orange County > Irvine (0.04)

Genre: Instructional Material (0.54)

Industry: Retail > Online (0.40)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.62)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)

Add feedback

How YouTube Uses Mechanical Turk Tasks to Help Train Its AI

WIREDMar-22-2018, 18:24:54 GMT

It's no secret that YouTube has struggled to moderate the videos on its platform over the past year. The company has faced repeated scandals over its inability to rid itself of inappropriate and disturbing content, including some videos aimed at children. Often missing from the discussion over YouTube's shortcomings, though, are the employees directly tasked with removing things like porn and graphic violence, as well as the contractors that help train AI to learn to detect unwelcome uploads. But a Mechanical Turk task shared with WIRED appears to provide a glimpse into what training one of YouTube's machine learning tools looks like at the ground level. MTurk is an Amazon-owned marketplace where corporations and academic researchers pay individual contractors to perform micro-sized services--called Human Intelligence Tasks--in exchange for a small sum, usually less than a dollar.

artificial intelligence, social media, video, (12 more...)

WIRED

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.63)

Add feedback